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Abstract 

In default of prior probability the parameter is not denned as a random variable, hence 
there can be no genuine prior-free parametric inference. Nevertheless prior-free predic- 
tive inference regarding any future data is generated directly from the datum of a location 
measurement. Such inference turns out as if obtained from a certain pdf ("fiducial") in- 
directly associated with the parameter. This false pdf can expedite predictive inference, 
but is inappropriate in the analysis of combined measurements (unless they all are location 
measurements of the same parameter). Also it has the same distribution as the ostensible 
Bayesian posterior from a uniform "prior". However, if any of these spurious entities is 
admitted in the analysis, inconsistent results follow. When we combine measurements, we 
find that the quantisation errors, inevitable in data recording, must be taken into consider- 
ation. These errors cannot be folded into predictive inference in an exact sense; that is, we 
cannot render a predictive distribution of a future datum except as an approximation. 

Keywords: location measurement; combination of observations; parametric inference; pre- 
dictive inference; prior-free inference; quantisation error; digitisation; frequentist interpre- 
tation; pivotal inference; the fiducial argument; fiducial probability; intuitive assessment; 
prior-free assessment. 

1 Introduction 

This work aims at a critical foundation of "pivotal inference", with a generalisation to combined location 
measurements. The issue is not in the application, because the solution is obvious anyway, at least 
in form, from direct intuition; the issue is in stating this solution in non-problematic terms, within 
the calculus of probabilities, without any supplementary principle. Addressing the general problem of 
prior-free parametric inference hinges on our understanding this plain situation. 

If in some measurement the error is understood as a random effect influenced only by factors outside 
the scope of this measurement, we call it a "location" measurement. For example when we measure the 
angular distance of two stars in a clear sky, the probability density function of the error depends on the 
properties of the instrument but is regarded as causally independent of either the true angular distance 
or of the resulting datum. 

If there is prior probability associated with the location parameter we apply B ayes' Theorem to 
generate statistical inference. The present work deals mainly with cases in which there is no prior 
probability. If we do not know in what way the two stars in the example have been picked, then we have 
such a case. 

*anakreon@hol . gr 
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Intuitive inteipretation of the result of a location measurement is quite plain. If the error is known 
to follow, for example, a normal distribution, intuition says that the information we gain about the 
measured parameter is expressed by a normal distribution which has the same variance. Yet against this 
plausible claim a formal objection is raised, that either there is prior probability or the parameter is not 
a random variable; if the parameter is converted into a random variable in the course of the analysis, 
some paradoxical (at least) consequences follow. At any rate, let us call this interpretation "intuitive" or 
"prior-free" parametric "assessment" (reserving the term 'inference' for procedures defined only within 
the calculus of probabilities). It is not distinct from so-called pivotal inference or the fiducial argument. 

Related theoretical discussions often involve such terms that do not provide a clear correspondence 
to an experimentalist's intuition. Moreover the logical standing of these probability distributions has 
been challenged with regard to certain paradoxes. To avoid logical inconsistency this work begins 
with a careful analysis of the location measurement, without any direct reference to specific statistical 
foundations, like objective Bayesian, fiducial, structural, or pivotal. 

However, with the kind of measurements considered here, we shall encounter no difficulty in de- 
riving predictive inference regarding any future datum that is related to the same parameter, without 
resorting to parametric inference. That is, knowing the datum x \ of the location measurement and the 
model Prx(x|6) of any future measurement, we obtain directly Prx(jc|jfi). Then it turns out that intuition 
is vindicated somehow, because predictive inference seems as if generated from the density function of 
intuitive assessment. Yet the latter is only a useful formula, not genuine probability pertaining to the 
measured parameter. In particular this supposed pdf is not meant to be the prior in a Bayesian analysis of 
the second measurement - this and related issues will occupy us in Sec. 12.1 .51 - though such presumption 
is harmless when we combine location measurements of the same parameter. Let us leave for another 
work the question whether predictive inference can in general be formulated as if from coherent para- 
metric inference, that is, even when it is based neither on a location measurement nor on combination 
of location measurements of the same parameter. 

If one assumes an unnormalised uniform density as prior probability for the parameter (that is, an 
unwarranted and possibly improper flat "prior") the related posterior pdf coincides with the supposed pdf 
of intuitive assessment. Moreover a remarkable multitude of proposed statistical foundations provide 
the same result to the problem of location measurement (regarding those, see d and [2|). In contrast to 
them the present work does not introduce any special axiom beside the usual postulates of probability. 
For that matter, one need not abandon the frequentist interpretation. 

2 A single location measurement 

2.1 Preliminary analysis 
2.1.1 Basic considerations 

Definition. If a measurement provides a single datum x for estimating a scalar parameter 6, the terms 
'location parameter' and 'location measurement' mean that, in this parametrization, the measurement 
error, e = x—Q, is additive noise of known pdf Pier's ) = f(e), where '$ ' stands for the background ev- 
idence or assumptions that establish this property. Random variable E is regarded as causally unrelated 
to anything else in this model. The symbol 6 represents any conceivable value of the parameter, not just 
the unknown true value. Each time we simulate the measurement for testing purposes, the parameter 6 
is assigned some arbitrary value. 

We assume that our first measurement is such a measurement, on the basis of assumptions $ i . With 
reference to any particular value (real or hypothetical) of the parameter, before the measurement the first 
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datum corresponds to a random variable: 

Pr^i^e) =Pr jri (* 1 -e|Bi,e) =/i(*i-e). (i) 

(Density functions are represented by functions in the stated parametrization.) If the parameter is un- 
known, then this pdf is unavailable. 

2.1.2 Bayesian inference for the parameter and for the error 

It may happen that, independently of the result of the measurement, we also possess some information 
H leading to the definition of a random variable & corresponding to the parameter, so that Pr@(8|#) 
= 7l(6). (For example, suppose that the true value of the parameter is a random sample drawn from a 
known population.) Then, using that pdf as the prior, Bayes' Theorem furnishes the posterior pdf of 0: 

Pr®(6|# oc 7t(0)/i(jCi-0). 

From the combination of Pro(6|:?/) withEq.^ the pre-measurement joint pdf of & andX is defined, 
7t(6) /i(xi— 6). The marginal pdf of E\ (=X\—&) is of course. After the measurement, e\ 

corresponds to the random variable x\— &. Therefore the pdf of E\ after the measurement is 

Pr £l (ei|^,®i,jci) =Pr@(jci-ei|^,«i,JCi)oc7i(jci-ei)/i(ei). (2) 

Note that this updated pdf of E\ can also be viewed as the posterior of an alternate Bayesian esti- 
mation, concerning E\ rather than 0. An instrument maker is primarily interested in the error of this 
measurement. Based on trials or on currently accepted theory, he has established a prior pdfPrE { {e \ \ "B \ ) 
= fi(ei) about the error, and is ready to update this pdf (with regard to the particular instance only) 
when he obtains datum x\ and prior 7i(6) for &. The likelihood of e\ would be Vxx x {x\\9{ ,e\) = 
Pre (xi — e 1 1 H , e \ ) =%(x\— e{). The resulting posterior for the error would be <=c Jl(xi — e i ) /i (e i ) , the 
same as that obtained in Eq.|2] The post-measurement pdf of the error has been obtained in two different 
ways to introduce an argument in the following section. 

Frequentist interpretation of the posterior probability of the error (or any variable) requires embed- 
ding the actual situation in the most general ensemble of imaginary experiments that can be conditioned 
by the same particulars. In our case, the conditioning consists of datum x\ in conjunction with prior 
information 9{ . Far from being ad hoc, this embedding schema is implicit in common and indispensable 
uses of probability. For example, after a coin is tossed but before the outcome is seen, we still associate 
a probability 1/2 with 'head', though the outcome is no longer random but only unknown. Therefore in 
such cases we must allow the use of probability to describe one's state of knowledge; in particular, to 
weigh forecasts about a fixed unknown quantity, if in some sense it has been drawn from an ensemble. 
In certain other cases, for example if the unknown quantity is supposed to be a constant of nature, the 
notion of probability would be problematic. 

2.1.3 What inference is possible in the absence of prior probability 

Under the assumption that the parameter is a random variable, Sec. l2.1.2l contains two equivalent ways 
of reckoning the posterior probability of the error. Based on the rationale of the second derivation, we 
can form the following argument. Suppose that there is no prior probability associated with 6; then the 
instrument maker cannot apply Bayes' Theorem to update his prior for the error; therefore the pdf of E\ 
after the measurement will remain f\ (e \ ). 

The ensemble of imaginary experiments corresponding to a prior-less location measurement in- 
volves all possible values of the parameter, though with an indeterminate distribution, because the pa- 
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rameter is not a random variable. Without a definite distribution of values for the parameter, conditioning 
this ensemble on the datum alone cannot update the pdf of the error. 1 

Caution must be exercised in interpreting random variable x\ — E\, after the measurement. (For that 
matter, before the measurement x \ neither is constant nor corresponds to a random variable, except only 
in relation to any fixed hypothetical value of the parameter; that is, before the measurement there is 
no available definite distribution associated with the datum.) It is tempting to identify x\—E\ with the 
parameter itself, since its unknown true value is equal to the unknown true value of the parameter. 2 Yet 
this premise would lead to paradoxical, at least, conclusions (as in Sec. 12.1.51 : it is not adopted here. 
The parameter remains an unknown constant only, not converted into a random variable. 

Now we come to the problem of predictive inference. In general the second measurement is modeled 

as 

Pr x {x\M ,6) =p(x;Q) (3) 

that is, x|5W ,8 corresponds to a random variable. 

We can also match x\M ,x\,e\ to a random variable, because each possible value e corresponds to 
6 = xi— e\\ 

Pr x (x\!M ,xi,ei) = p(x;xi-ei) (4) 
so that, in conjunction with the pdf of E\, joint pdf of X and E\ is defined: 

Vrx^ x {x,e\\(B\,M ,x\) = p(x;x\-e\) f\(e\) . (5) 

Predictive inference for X amounts to the marginal pdf of this variable, 3 

Pr x (jc|!Bi,5W,xi) = j ' dep{x,x\-ei)f\(e\). (6) 

We have obtained prior-free predictive inference about any future measurement directly, thus by- 
passing the problem of prior-free parametric inference. Not only the assumption of a pdf for the pa- 
rameter is not required, but it would annul this derivation. Of course we depend on the assumption that 
there is a single, constant true value of the parameter, otherwise the substitution of 6 (in the model of 
the second measurement) withxi— e\ would not be justified. 

The parameter itself is not a random variable, yet there is a remarkable coincidence. If we only 
pretend that in the future measurement the parameter is a random variable with the same pdf as x\—E\ 
the resulting predictive inference for X will turn out the same as in Eq. |6] Therefore there is utility in 
communicating the pdf of x\—E\, because it happens to generate correct predictive inference. For that 

'We can represent the unconditioned ensemble by means of a sequence of actual simulated measure- 
ments, each starting with some arbitrary 6,-, provided that the distribution of these values is understood 
as irrelevant. Then conditioning on the datum alone (not on the presumed true value of the parameter, 
which is inaccessible) cannot modify the distribution of the error, which remains 

2 Before the measurement, the expression x\—E\ is equivalent to 6, but the meaning changes after 
the measurement: x\ is restricted to the actual result but E\ is still understood with reference to the 
ensemble of all measurements. 

3 If we take into account that we never know x\ directly except by means of some digital transcript 
y i , w e sh all b e interested in conditioning by y\ rather than by x\. This problem is addressed in Sections 
EnDandO 
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matter, only in this capacity our intuitive parametric assessment can acquire a proper definition. 4 

Pr {xi _ El) (Q\'B 1 ,x 1 ) =Pr El (*i-e|!Bi,*i) =/i(*i-9). (7) 

The same result, in form, has been obtained also by means of applying fiducial, pivotal, structural, or 
objective Bayesian "inference", but in this work we do not consider any kind of inference not based on 
the calculus of probabilities exclusively. 

The density function of intuitive parametric assessment (that is, the pdf of x\—E\) has been defined 
without reference to what the parameter stands for; it represents only whatever the apparatus has mea- 
sured. It makes no difference whether the true value of the parameter is understood to be, for example, 
a constant of nature or some quasi-constant magnitude (like the radius of a star). The pdf is not o/that 
magnitude but only related to it, as an accessory of predictive inference. Therefore this pdf is not in 
conflict with the frequentist interpretation of probability. 

2.1.4 Essential distinctions and caveats 

Sometimes the issue of prior-free assessment is confused due to considerations assuming a joint pdf of 
and X\. For that matter, it cannot be denied that the existence of some pdf related to the parameter 
after the measurement, along with the family of conditional pdfs Pr^, (x\ | 6), defines some joint pdf yet 
this pdf is not of & and X\ but of x i — E i and X2, where X2 represents the datum of a hypothetical second 
iid measurement. Besides, if one assumed a joint pdf of some & (defined independently) and X\, then 
the Bayesian method would be applicable; the marginal pdf of that & would be the prior. 

If one assumes a prior uniform in the location parameter (we have not done so) the posterior will be 
represented by the same density function as that of prior-free assessment, though the two functions are 
different in intent. In this work, if a prior happens to generate the same posterior as the pdf of prior-free 
assessment, it will be called the "false prior" of the measurement to emphasise that (like a false ceiling 
or false teeth) it has a function (it generates the same result as prior-free assessment) but is not the real 
thing. 5 In any specific problem, existence of a false prior would simply mean that prior-free prediction 
is equivalent to prediction based on that prior (as it happens; not because Bayes' Theorem applies). 
Non-existence of a false prior would imply either that predictive inference cannot be reduced to any 
presumptive pdf of the parameter, or a situation in which the ratio of that pdf to the likelihood depends 
not only on 6 but also on x. 

The case of a location measurement without prior probability is so intuitively transparent that it 
is inconceivable for any proposed statistical foundation to provide any solution other than a density 
function of the same profile as that of Eq. that is, equivalent to the posterior obtained from a uniform 
false prior. "Virtually all default Bayesian methods recommend this conditional prior, as do various 
"structural" and even frequentist approaches" (|2 1, Sec. 4.3, referring to a more general case). Therefore, 
in the case of a location measurement at least, it seems legitimate for an experimenter to follow plain 
intuition and consider the option of no prior. Of course we must examine in what circumstances this 

4 Fisher has remarked that the problem of fiducial predictive inference based on datum x\ can be 
solved "directly", not only "after the [...] distribution of the population parameter [...] has been obtained" 
([3], Sec. II), "without discussing the possible values of the parameter 6" ([4|, Sec. V.3). Yet he defines 
fiducial prediction only as derived from fiducial probability of the parameter values; consequently the 
simplification he mentions is only a secondary issue. In the present work predictive inference is defined 
in the absence of any distribution for 6; therefore the possibility to also calculate it as if from some 
intuitive density function of the parameter is regarded as fortuitous, proved in the case of location 
measurements but not yet guaranteed to be generally true. 

5 That is, a false prior for 6 is not meant to be combined with the parametric family Pr^ (x| 8) to produce 
a joint pdf of & and X. The only use of a density that is known to be a false prior is to be multiplied with 
the likelihood, Prx(x|6), in order to calculate the density function of intuitive assessment. 
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option is appropriate; on the other hand, we must also assess when prior probability can be asserted on 
the basis of educated guessing only. These questions are left for another work. 

2.1.5 Can Bayesian estimation be combined with prior-free assessment? 

If someone regards the pdf of prior-free assessment as converting the parameter into a random variable, 
then in the analysis of a second measurement Bayes' Theorem would be applicable, with this /^provid- 
ing prior probability. It is not surprising that this consideration leads to inconsistency in the results. For 
instance, if we combine two location measurements regarding not the same parameter but two functions 
of the same parameter, the posterior pdf depends on the order of analysing the two measurements. 

Yet even if we reckon strictly within the limits of predictive inference, the same predicament seems 
to arise. When the second datum, x, is obtained, Eq. |4] provides the likelihood of any value e i of 
the error; the likelihood can be combined with the prior pdf, f\{e\), to form the posterior pdf of E\. 
If this is correct, then new pdf of x \ —E i will be obtained, which in turn may be used for predictive 
inference regarding a third datum, and so on. In effect random variable x\—E\, with updated pdf after 
each measurement, would act as a surrogate random variable for the parameter, even after we combine 
two or more location measurements. Yet in this way the parameter turns effectively into that random 
variable, in spite of the initial assumption. If this concern seems remote from practical applications, let 
us consider the combination of two location measurements regarding not the same parameter but two 
functions of the same parameter, like the diameter and the volume of a sphere. If we update the pdf of 
the error in the way suggested above, then predictive inference concerning any future third measurement 
depends on the order of combining the two data (the method is equivalent to admitting the false prior of 
the first measurement). Unless we suppose that information may have some metaphysical consequence, 
we cannot accept that predictive inference about any third datum may depend on the order of receiving 
the first two data. Almost certainly there is some error in the argument, and we must find it (otherwise 
this inconsistency would be credited to the common postulates of probability calculus). 

2.1.6 Predictive inference is known only as an approximation 

A careful review locates the fault in overlooking an approximation attendant to employing Eq. |4] The 
"analog" datum x \ is registered as a numerical record y i , using a scale of step Pi , so that a quantisation 
error r\ (= y\—x\), however small, is unavoidable, constrained in the range [— P\/2,P\/2). Therefore 
corresponding to any conceivable value e\ (= x\— 0) there is no single value 6, but rather some small 
range of values. 

Nevertheless the pursuit of predictive inference is not impaired thereby, except that we end up with 
an approximation of the unknown true density function. Since x\ is strictly speaking inaccessible, Eq.|6] 
cannot be our goal. We can redefine predictive inference, to be conditioned by y i and r i instead of x i : 

Pr x (x|!8i,3tf ,vi,ri) = J de l p(x;yi-r l -ei)f l (ei). (8) 

Inasmuch as r\ cannot be known, we have not yet produced exact predictive inference. We would like 
to define "Prx(jc|s i , fW ,yi)" but, as we shall see in Sec. 12.21 we cannot apply any exact meaning to 
this expression in the absence of prior probability. Nevertheless for small enough P\ (we are interested 
only in the limit of vanishing Pi because in general we want to keep round-off errors so small as to be 
insignificant) and assuming smoothness of fi(-) and /?(•;•), the true but unknown Prx(jc|Si ,M ,yi,ri) is 
uniformly approximated by the following density function (the limiting operation assumes an imagined 
sequence with Pi going to 0; the corresponding values r\, in {—P\/2,P\/2), are of no consequence in 



6 



defining this limit) 



lim Pr x {x\$i,M ,yi,r!) = J de\p(x,y\-e\)f\(e\), (9) 

not dependent on r\. In practice it takes the place of the non-existent "Prx(*|® j , <M ,yi)". 6 

Yet in view of this complication the troubling Bayesian argument in Sec. I2.1.51 must be replaced 
by a more detailed examination, as in the following sections; the perplexing inconsistency that it had 
generated disappears, only at the cost of our having to reconsider the method of updating the pdf of e\. 
For that matter, we shall see that r\ does not correspond to any exactly defined pdf, therefore there is no 
exactly defined updated distribution for e\, unless it is conditioned by the unknown r\. 

2.2 Quantisation error in a single location measurement without prior 

Even in the plainest measurement, every time a numerical record is kept a quantisation error is gener- 
ated, also known as "digitisation" error. We are justified to neglect it in the case of a single location 
measurement, because it is possible to define the pdf of prior-free assessment with good approximation 
without reference to this error. Yet combinations of measurements cannot be studied if the quantisation 
error is neglected. To prepare that study, we include now the quantisation error even in the analysis of a 
single location measurement. 

For simplicity, let us assume that the designer takes sufficient care so that digitisation does not 
involve any error beside the quantisation error, or rather that such errors are negligible in comparison 
with the principal "analog" error, E. The quantisation error remains for ever unknown (unless we 
preserve and digitise again the same x) unlike the total error, e+r, which can be estimated to any desired 
precision, with subsequent measurements. 

Note that need for consideration of the quantisation error does not arise only in prior-free inference. 
Without it, even the model problem of Bayesian inference with continuous sample space would be 
impossible to solve because the probability of any real datum x is zero. Actually the likelihood of 6 is 
defined not for some datum x but for a range of x between y—P/2 and y+P/2. The usual underlying 
assumption is that the assessment of P does not involve 6. It is not required that P be very small; 
smallness of P only simplifies the calculations. 

Though it is commonly assumed that if P is small enough then r is distributed approximately uni- 
formly over its range, here we must examine why and in what sense this is valid. 

We must clarify in what sense we assign probability density to r\ after the measurement. Typically 
the quantisation error of a digitisation remains unknown, but for the sake of this argument now we shall 
imagine that we measure it. Let us assume a second digitisation of the same analog quantity x \ , with a 
much smaller quantisation step, so that we can estimate r\. The "true" distribution of r\ is understood 
in reference to an ensemble of iid measurements of the same parameter, each followed by the two 
digitisations, the first of which happens to produce the same y i (else it is excluded from this ensemble). 

Thus specified, the distribution of ri is defined in terms of the distribution of jci|fBi,6 which is 
f\(x\— 0), from Eq. (That is, the distribution of r\ in [— P\/2,P\/2) corresponds to a normalised 
slice of /"i (jci — 0), with^i between y\—P\/2 and y\+P\/2 - see Fig.^) 

^(rilBi.e.yiJoc/i^-n-e). (10) 

6 Of course digitisation steps cannot be made arbitrarily small; on the contrary, in any set of measure- 
ments, Pi are fixed, finite magnitudes. The limiting operation is intended to justify the approximation of 
a true but unavailable pdf by a known density function. 
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Figure 1 : The result of a digitisation corresponds to an interval centered at y i , of width Pi . The "true" 
distribution of quantisation error r\ (imagining an ensemble of measurements starting with and ending 
with y 1) is « /i (j i — r\— 8). Inasmuch as 6 is unknown and not a r. v., this distribution is indefinite. Yet 
in some sense it can be approximated by a rectangular distribution, regardless of 6, though at the cost of 
excluding from the analysis all 6 whose likelihood is smaller than some threshold. 



However this definition alone cannot amount to any determinate pdf being assigned to r\, because the 
true 8 is unknown. 7 Operative meaning is generated by the following considerations. 

Approximating this unknown distribution by a rectangular one would be justified if we could prove 
that, when P\ approaches 0, the distribution of r\/P\+\/2 (in [0, 1)) converges uniformly to 1 for all 
8. Whether this is true depends on f\(-). Even in the case of a normal distribution the premise is not 
true: considering any Pi -wide slice of a normal distribution, the log of the ratio of left-to-right heights 
is proportional to the distance from the mean; therefore it goes to ±oo for 6 — > ±°°. 

With a normal f\ (•), the only way to contrive this uniform convergence is by excluding from consid- 
eration all 6 whose likelihood {f\ (y i —6)) is less than some arbitrary small threshold. With this provision 
not only normal distributions but also any smooth fi(-) with bounded derivatives can be made to fulfil 
the condition of uniform convergence. Detailed proof is rather straightforward. In this way we have 
defined a provisional approximation of the post-measurement pdf of r\ by the rectangular distribution. 

Perhaps a summary is helpful here. To advance our main undertaking, accurate combination of 
location measurements without prior probability, we have seen the necessity of taking into account 
quantisation errors. These errors are commonly associated approximately with rectangular distributions 
if the step is small enough. Such association would be justified only if the true distribution converges 
uniformly to the rectangular shape as Pi goes to 0. Yet for the post-measurement pdf of r\ this is not 
true in general. To justify approximating this pdf 'by a rectangular distribution not only we assume a 
small-enough Pi but also we provisionally exclude all 6 whose likelihood is smaller than some arbitrary 
threshold. 

This provision amounts to setting an arbitrary high confidence level 1 — £ to generate a (very wide) 
confidence interval, or group of intervals, for 6, using likelihood to provide the requisite ordering. Then 
the premise of uniform convergence is at the same confidence level. Under this provision, the quantisa- 

7 If there is a prior for 6, we usually are not concerend with this true but elusive pdf because then 8 
is a random sample from a population, so that we can fold the posterior pdf of & with f\ (y \ —r\ —6) to 
produce the posterior pdf associated with r\, whether Pi is small or not. The approximation of this pdf 
by a rectangular one for small Pi is warranted on the basis of smoothness and bounded derivatives. 
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tion error corresponds approximately to a random variable R \ of rectangular pdf in \—P\/2,P\/2), as if 
independent of E and without dependence upon 6. 

The pre-measurement pdf of r\ is an average over all y\ (that is, 0, ±P, ±2P, ...) with reference to 

6: 

Pr J!l (ri|i8,e)=^ i Pty 1 (yi|«i,e)Pr J | 1 (r 1 |i8 lj e,yi) = J^ i /i(yi-ri-e). (11) 

It depends on 6 in a periodic way, with period Pi . Like the related post-measurement pdf it cannot 
be known exactly because 6 is unknown and not a random variable. Uniform approximation by a 
rectangular distribution can be derived from general assumptions of smoothness in combination with 
Eq. without need for any special provision. 

In conclusion, in the limit of small P\ , the quantisation error corresponds approximately to a random 
variable R i of rectangular distribution, the same before and after the measurement, if we impose on 
some threshold (however small) of likelihood. 

Since we have established some meaning for associating r\, approximately, with a random variable 
R i of rectangular distribution, now we may fold this pdf of R \ with Eq. [8] to provide approximate 
meaning to "Prx(x|®i,iW ,yi)". Yet this would not be approximation of a defined if unknown pdf 'but 
only taking advantage of approximations to define a density function that is not otherwise meaningful 
(inasmuch as there is no prior for 6 - see n.0. Well defined predictive inference is provided by the true 
though not exactly known Pr^ (x\ & \ , M ,y i , r \ ), approximated in Eq. [9] 

Rather than being regrettable, the lack of such predictive inference that would be both accessible and 
exactly defined is our way out of the inconsistency mentioned in Sec. 12.1 .51 For that matter, uncertainty 
is the very subject of probability calculus, so nothing is lost if it is combined with approximation. 

The considerations outlined in Sec. 12. 1 .31 regarding the utility of the density function of prior-free 
assessment - now f\ (y i —9) rather than f\ (jci — 6) - have not been affected. This density function does 
not express probability density of 6 but if used in that sense it provides predictive inference, within 
approximation depending on the size of the digitisation step. 



3 Combining two location measurements 
3.1 Statement of the problem 

So far we have considered one completed location measurement in relation to a future generic measure- 
ment. Now we consider two possibly different completed location measurements of the same parameter, 
in relation to a future generic measurement. That is, the two errors 
e\ = jci — and £2 = X2—Q 

follow true pdfs independently of any other variable and of each other: 

Pr £l (<?i|!8i) = /i(«i) and Pr El (e 2^2) = fi(e 2) ■ 

Therefore, before the data become known, the joint pdf of the two errors is 

Pt Eu E 2 (ei,e 2 \{Vi}) =fi(e 1 )f 2 (e 2 ). (12) 

When the two data x \ and X2 are taken into account, e 1 and e2 are jointly constrained by the condi- 
tion 

e\ —e2=x\ —X2 (13) 
which defines a straight line / in the (^1,^2) plane, parametrized by 6: 

e\ = ^1—9 

e 2 = x 2 -Q. (14) 
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Therefore we need define relative probability for segments of this line. But such probability may be 
defined only with reference to some limit, because joint probability vanishes for line segments. 

However, if we compare two segments of / of equal length, chosen so that, at all points of the first 
segment, the joint pdf is higher than at every point of the other segment, we intuitively perceive that the 
probability of the first segment is higher than that of the second one. For an expanation why this is so, 
we must take into account the errors generated in the digitisation of data. 

3.2 Taking into account the quantisation errors 

In terms of the digitised outcomes y; (i = 1,2) (these are the ones actually issued) Eq.s[T4lbecome 

e\ = yi-n-Q 

e 2 = y 2 -r 2 -Q. (15) 

By eliminating 6 we obtain 

e\ -e 2 =y\ -yi + ri-ri (16) 

instead of Eq. [O] Since each r, is constrained within [— P,/2,P,/2), the above equation defines a strip 
in the (ei,e%) plane. 

It will be convenient to transform the pair of random variables (E\,E2) into (£+,£_): 



E + = (Ei+E 2 )/V2 

E = (E 2 -E 1 )/V2. (17) 

This is equivalent to a ji/4 rotation of coordinate axes. 

The related Jacobian determinant is 1, therefore the pre-measurement joint pdf of E + and E is 

Pr £+>£ _ (e+,e-\{<Bi}) = M(e--e+)/y/2) f 2 ((e^+e+)/V2) . (18) 

A corresponding rotation of axes in the the (r\,r 2 ) plane takes us from (r\,r 2 ) to (r+,r_) (as in 
Fig.©: 



r+ = (ri+r 2 )/V2 

r _ = (r 2 - ri )/V2. (19) 



In terms of the new variables, Eq.slT5lbecome 

e+ = (y 2 +yi)/V2-QV2-r + (20) 
e~ = {yi-y\)/y/2-T- . (21) 

We can form an argument similar to that in Sec. l2.1.6l First we must establish the pdf pertaining to 
E + after the measurement. This pdf will be conditioned not only by y i and y 2 but also by the unknown 
r + ; that is, Pr£ + (e + |{® ! },{y ! },r + ). Each possible r + corresponds to a certain range of possible values 
for r_, say [r_ )0 ,r_ j) (as marked on Fig. |3>; from Eq.|^we obtain a corresponding range of values for 
e_; this range defines a strip in the plane; the marginal pdf of e + in this strip (extracted from 

the j oint pdf of Eq. IT2l is the pdf of E + with reference to y \ , y 2 , and r + . 

In defining this pdf we have not required that P,- be small but, since this assumption will be needed 
soon, we simplify the equations by introducing it from the beginning. Therefore the strip in the plane 
(e+ , e_ ) will be reckoned as approximately of infinitesimal width, with e- w 0. In the following equa- 
tion the limiting operation regards an imagined sequence of pairs (P\,P 2 ) going to (0,0) with a constant 
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Figure 2: Quantisation errors of two measurements 



or bounded ratio. The corresponding values r + , each in —(P1+P2) / \ 2y/2J , (P1+P2) / \2\/2\ ), 
make no difference in defining this limit: 

lim Pr £+ (, + |{ S; .},{j,.},r + ) - /t ) /2 (^_Zi+f+ ) . (22 ) 

which serves in approximating the pdf of e+ up to a normalisation constant. 

Now we can formulate predictive inference regarding x, the datum of a future generic measurement, 
which is modeled as in Eq.|3] For any tetrad of tentative values y\, y 2 , e + , and r + , using Eq.|20]we find 
the corresponding value of the parameter, 



B=(y 2 +yi)/2-e + /V2-r + /V2. (23) 
In reference to the the tetrad y\,y 2 ,e + , and r + the future datum corresponds to a random variable: 

Pr x (x\{V i },{y i },e + ,r + )=p(x-,({y 2 +y l )/2-e + /V2-r + /V2)) . (24) 



yi and y 2 are known; conditioned by r + , a pdf for e + is defined and approximated in Eq.[22j assum- 
ing that P, are small. Therefore we define joint pdf of X and E + conditioned by y\, y 2 , and r + , which 
we express here in the limit of vanishing P,, 

lim Pr x , E+ (x, e+ \ { <B ,• } , M , {y ,■ } , r + ) 

, , yi-yv e + , yi-y\ e + y 2 +y\ e+ 

Predictive inference for X amounts to the marginal pdf of X; in the limit of negligible P, 

lim fr*(x|{!8i},ar,{yi},r + ) 

, f f y2-yi e+ y 2 -yi e+ y 2 +yi e + 
de + fx (— +^) P<* — ) • < 26 > 

In this limit there is no dependence upon r + . 
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As in the case of a single location measurement, also from the combination of two location mea- 
surements we have formulated predictive inference conditioned on a function of the digitisation errors 
with the understanding that, whatever that magnitude happens to be, the corresponding true pdf of X 
would not deviate to a large extent from the density in Eq. |26] 

The same density function of X can be obtained if we pretend that in the future measurement the 
parameter is a random variable with pdf f\{y\— 0) fiiyi — 8)- So this density function represents our 
prior-free assessment of 8 based on the two location meassurements combined. It is the posterior arising 
from a false prior uniform in 6. 

In terms of the discussion in Sec. 13. II (that is, disregarding the digitisation error) we have defined 
probability density along line /, defined by Eq. El on the [E\,E*i) plane, simply as proportional to the 
pre-measurement joint pdf of E\ and E%. 

4 Discussion 

Location measurements of the same parameter are very easy to interpret intuitively, so that the arguments 
in this work may be regarded as belabouring the obvious. The reason for the systematic treatment is 
the need for preparing the concepts pertaining to the study of generic measurements, even counting 
measurements, perhaps also involving more than one parameters. That development will be presented 
in another work. 

We have examined in detail combinations of only two location measurements of the same parameter. 
Generalisation to any number N of such measurements is straightforward. After the measurements, the 
Af-dimensional error will be constrained close to a straight line by ,/V— 1 ancillary conditions similar 
to Eq. |^ In analogy to variables £_ and E + of the two-measurement case, ,/V new variables: E-\, 
. ..E r N _ ! \ , and E + are introduced. In the same manner, r_ 1 , . . . r_ \ \ , r + are defined. The result is, in 
the approximation that assumes very small digitisation steps, that predictive inference is approximately 
as if from a flat false prior. 

Here is an idea for further research. The simplicity of the result, as expressed in the last paragraph of 
Sec. 13.21 perhaps indicates the existence of a proof more general than the one presented in that section. 
Such a proof would be advantageous if it could be generalised to address the combination of location 
measurements regarding not the same parameter but functions of the same parameter. For example, 
combining a measurement of a side of a cube with a measurement of the area of the cube and with a 
measurement of the volume of the cube. After the measurements, the Af-dimensional error is retricted 
on a curve (if we disregard digitisation errors) parametrized by 8. An obvious conjecture is that the pdf 
along this curve will be proportional to the pre-measurement A^-dimensional joint pdf at the same point. 
If we could prove it, then also Jeffreys' prior would be shown to generate the same results. 

5 Conclusions 

Applying only the common postulates of probability, without any specific assumptions about statistical 
inference, from single or combined location measurements of the same parameter we can obtain prior- 
free predictive inference. 

The same density function would be obtained if we started with a uniform prior for 6. Yet the 
logical standing of that Bayesian posterior is distinct from prior-free assessment. Moreover, we have 
not established either that in any generic situation predictive inference may be reduced to a presumptive 
pdf for the parameter, or that such density function is the posterior from some false prior. 

To be able to combine measurements, we must take into account the inevitable digitisation errors in 
data recording, however small. In the absence of prior probability for the parameter, there are no exactly 
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defined pdfs for such errors. Therefore we cannot just fold them into predictive inference; that is, the 
true pdf of predictive inference is unknown but can be approximated by an available density function. 
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